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ABSTRACT 

Triangles are an important building block and distinguish- 
ing feature of real- world networks, but their structure is still 
poorly understood. Despite numerous reports on the abun- 
dance of triangles, there is very little information on what 
these triangles look like. We initiate the study of degree- 
labeled triangles — specifically, degree homogeneity versus 
heterogeneity in triangles. This yields new insight into the 
structure of real- world graphs. We observe that networks 
coming from social and collaborative situations are domi- 
nated by homogeneous triangles, i.e., degrees of vertices in 
a triangle are quite similar to each other. On the other 
hand, information networks (e.g., web graphs) are domi- 
nated by heterogeneous triangles, i.e., the degrees in trian- 
gles are quite disparate. Surprisingly, nodes within the top 
1% of degrees participate in the vast majority of triangles in 
heterogeneous graphs. We also ask the question of whether 
or not current graph models reproduce the types of triangles 
that are observed in real data and showed that most models 
fail to accurately capture these salient features. 

1. INTRODUCTION 

There is a growing interest in understanding the struc- 
ture, dynamics, and evolution of large scale networks. Ob- 
serving the commonalities and differences among real- world 
networks improves graph mining in many aspects ranging 
from community detection to generation of more realistic 
random graphs. 

A triangle is a set of three vertices that are pairwise con- 
nected and is arguably one of the most important patterns 
in terms of understanding the inter-connectivity of nodes in 
real graphs [13, 34]. Note that the community structure is 
closely tied to triangles, and the degree behavior of trian- 
gles is an integral part of this structure [30] . Whether these 
graphs come from communication networks, social interac- 
tion, or the Internet, the presence of triangles is indication 
of community behavior. In social networks, it is consid- 
ered highly probable that friends of friends will themselves 



be friends, thus forming many triangles. Social scientists 
have long observed the significance of triangles, as they are 
manifestations of specific interaction patterns [11, 27, 35, 7, 
17]. For example, in friendship networks, a significant dif- 
ference among degrees in triangle vertices might indicate an 
anomaly such as existence of a spam bot [3]. 

In this paper, we take a closer look at the structure of 
triangles, specifically, the degrees of the triangle vertices. 
Consider the two triangles in Figure 1. How are the degrees 
of the three vertices related? Do these represent fundamen- 
tally different types of relationships and so appear in differ- 
ent sorts of networks? When we look at real- world networks, 
we may ask if there is a high incidence of degree homogene- 
ity, wherein vertices of similar degree come together to form 
triangles? Or do triangles tend to show degree heterogeneity, 
i.e., connecting vertices of disparate degree? 




Figure 1: Homogeneity versus heterogeneity in triangles. 
The triangle on the left is homogeneous since degrees of all 
its vertices are close, and the one on the right is heteroge- 
neous due to a mixture of high and low degree vertices. 

1.1 Background and Previous Work 

The study of triangles is quite prevalent in the social 
sciences community. Coleman [11] and Fortes [27] used 
the clustering coefficient to predict the likelihood of going 
against social norms. Welles et al. studied the variance of 
clustering coefficients for different demographics groups and 
found that adolescents are more likely to have connected 
friends than adults and are even more likely to terminate 
connections with friends that are not connected to their 
other friends [35]. Burt underlined the importance of nodes 
that could serve as a bridge between various communities [6] 
and tied this to the number of open triangles in a vertex [7] . 
Lawrence [17] observed that there are powerful homophily 
effects in who connects to whom in organizational environ- 
ments, who people are aware of, or whose opinions people 
attend to. Bearman, Moody, and Stovel studied homogene- 
ity of partners in a romantic network of adolescents [2] . One 
noteworthy observation in this study was the similarity on 



partnership experience, which corresponds to similarity in 
vertex degrees. 

The notion of describing graph structure based on the fre- 
quency of smah patterns such as triangles has been proposed 
under different names such as motifs [5, 24], graphlets [28], 
and structural signatures [12]. Triangle counts form the ba- 
sis for community detection algorithms in [4, 14]. They have 
also served as the driving force for generative models [30, 
34]. Eckman and Moses [13] interpreted the clustering co- 
efficients as a curvature and showed that connected regions 
of high curvature on the WWW characterized common top- 
ics. Directed triangles are important motifs for comparing 
and characterizing graphs [5, 12, 22, 23, 28]. For graph 
databases, exploiting frequent patterns have also been pro- 
posed for efficient query processing [32, 37]. 

The frequency of triangles is often measured using the 
clustering coefficient, as defined by Watts and Strogatz [34] . 
We first establish some notation. Consider an undirected 
graph G with n vertices. Let dj denote the degree of node j 
and tj denote the number of triangles containing node j. If 
we define a wedge to be a path of length 2, then the number 
of wedges centered at node j is (^2')- Now we can define 
various clustering coefficients. The clustering coefficient of 
vertex j, Cj, is defined as the number of triangles incident 
to j divided by the number of wedges centered at j, i.e., 
Cj = ^j/i^i)' average of clustering coefficients across 

all vertices (called the local clustering coefficient) is defined 
as C = ^ Cj . Let Vd — {j dj — d} be the set of vertices 
of degree d. We define the clustering coefficient of degree d 

to beC7d= ^Ei6V,C?J - 

The (global) clustering coefficient, also known as the tran- 
sitivity, is 

^ _ 3 X total number of triangles _ ^j tj 
total number of wedges (^^^^ 

For random graphs with no structure, C and C values are 
extremely small [26]. 

Most of the studies on degree-based similarity is based on 
assortativity, which was introduced by Newman [25]. Vari- 
ous studies have been conducted on the assortativity (or lack 
thereof) of real graphs [15, 21, 36]. However, Newman's as- 
sortativity measure is misleading to classify networks with 
heavy-tailed degree distributions because it produces either 
neutral or negative assortativity (disassortativity) values for 
most of the large scale networks as shown in Table 1 with r 
values. 

Most relevant to our work is that of Tsourakakis [33], 
which observed various power laws in triangle behavior. This 
work focused on triangle counts for nodes (how many trian- 
gles a node participates). He finds that the average number 
of triangles per vertices of a given degree follows a power- 
law distribution and the slope of the degree-triangle plot has 
the negative slope of the degree distribution plot of the cor- 
responding graph. It is argued that low degree nodes form 
fewer triangles than higher degree nodes. Our analysis shows 
that while this is certainly true for social networks, it does 
not hold for information networks, such as the autonomous 
systems networks. 

1.2 Contributions 

Our contributions fall into two categories. Our first set 
of results comes from empirical studies of degree relations 



in triangles of real graphs. Then, we perform experiments 
on a variety of graph models to show their (in) ability to 
reproduce the behavior of real graphs. 

Triangle homogeneity vs heterogeneity: We take a 
collection of graphs from diverse scenarios (collaboration, 
social networking, web, infrastructure) and measure trian- 
gle degree relationships. We compute various correlations 
between degrees of vertices in triangles to understand the 
homogeneity nature of these triangles. 

Our experiments showed that graphs coming from social 
or collaborative scenarios are completely dominated by ho- 
mogeneous triangles. There are a few heterogeneous trian- 
gles. This may not be surprising from a sociological view- 
point, since like should attract like. But graphs coming from 
web, routing, or communication are dominated by heteroge- 
neous triangles. It is interesting that in communication or 
routing networks, majority of triangles are formed by the 
vertices within top 1% degrees. 

We observed that there is a high correlation between global 
clustering coefficient C and the homogeneity tendency of tri- 
angles. The higher C a network has, the stronger homogene- 
ity tendency the network has. 

We also showed that the triangles in networks have diverse 
degrees of vertices and there is a varying distance among 
triangle degrees. 

The triangle behavior of graph models: Our result 
can be stated quite succinctly. No existing graph model re- 
flects the homogeneous and heterogeneous triangle behavior 
together. Many standard graph models like Preferential At- 
tachment, Edge Copying, Stochastic Kronecker, etc. do not 
generate enough triangles and they cannot approximate the 
clustering coefficients of the real graphs [29] . The Chung-Lu 
[8] model cannot generate homogeneous graphs and cannot 
approximate the clustering coefficient of the high clustering 
coefficient networks. However, Chung-Lu is the only model 
imitates the networks with low clustering coefficient and it 
is enable to generate heterogeneous triangles. 

The Forest Fire [20] and BTER [30] models generate a 
reasonable number of triangles (especially incident to low 
degree vertices) but these triangles are extremely homoge- 
neous. Low degree vertices, when they participate in trian- 
gles, exclusively form triangles with other low degree ver- 
tices. This happens regardless of parameter choices, and is 
a fundamental property of these models. This shows that 
while they can qualitatively look like social and interaction 
networks, the behavior of, say, heterogonous networks can- 
not be reproduced by these models. 

2. REAL- WORLD TRIANGLE BEHAVIOR 
2.1 Data 

We analyze the degree relations among vertices of trian- 
gles on a diverse set of real- world graphs: collaboration net- 
work (ca-AstroPh), citation network (cit-HepPh), trust net- 
work (soc-Epinionsl), co-purchasing network (Amazon0312), 
autonomous systems (as-caida20071105), routing network 
(oregonl_010331), web network (web-Stanford), and com- 
munication network (wiki-Talk). All these graphs were ob- 
tained from the SNAP database [38]. In our studies, we have 
symmetrized the graphs by treating all edges as undirected; 
made each graph simple, by removing self loops and parallel 
edges; and did not use edge weights. Cohen's algorithm [10] 
was used to enumerate all triangles. 



Table 1: Properties of networks we analyzed. 
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In Table 1, we provide the following properties of the graphs 
we analyzed: N = number of nodes; E = number of edges; 
p — E/N (density); C — global clustering coefficient; C — 
local clustering; T — number of triangles; a — the power-law 
exponent, which is computed by fitting power-law distribu- 
tion to degree distribution plots of the graphs [9]; /^go and 
kqq^ respectively, are the 90th and 99th percentiles of de- 
gree of all nodes participating in triangles (i.e., we obtain 
all nodes participating in any triangle (each node is only 
counted once), put their degrees in a list, and then pick the 
99th percentile of the degree list); dmax = maximum degree; 
and r = assortativity value. 

In this paper, a network whose global clustering coeflfi- 
cient, C is greater than 0.01, is referred to as a high-C net- 
work, and as a low-C network otherwise. In Table 1, the 
first 4 graphs are high-C graphs, whereas the last 4 are low- 
C graphs. 

2.2 Analysis 

We analyze the degree similarity of triangle vertices by 
grouping the triangles according to their minimum degree 
vertex. We first present the notation. For t = 1, . . . , T, let 
<^min(t), dmid{t), and (imax(t) dcuotc the minimum, middle, 
and maximum degree of the t-th triangle. For instance, if 
the t-th triangle has vertices of degrees 5, 10, and 4, then 
dmin{t) = 4, dmid{t) = 5, and (imax(t) = 10. Define the set 
B{i) to be the set of all triangles whose minimum degree is 
z, i.e., 

B{i)^{teT\dmin{t)=i}. 

We may then define some average statistics for each set 
B{i). Define di{i), d2{i), and ^3(2) to be the median of min- 
imum, middle, and maximum degree, respectively, of trian- 
gles in B(i). In other words, 

di{i) — median { (imin(t) | t G B{i) } = 
d2{i) = median { dmid{t) \ t G B{i) } , 
dsii) = median { (imax(t) | t G B{i) } . 

For instance, if the triangles in B{2) were given by [2 2 3], 
[2 4 5], and [2 3 3]. Then, the di{2) = 2, ^2(2) = 3 and 
d3{2) =3. 

To compare the relations among triangle degrees, we plot 
the d2{i) and ^3(2) versus di{i) as in Figure 2 and call them 
degree-comparison plots. Note that in these and all other 
log-log and semi-log plots, we use the exponential binning 
which is a standard procedure to de-noise the data when 
plotting on logarithmic scale. 



2.3 Observations 

By considering the degree relations of the triangle vertices, 
we make the following observations. 

Observation 1: The global clustering coefficient is an 
indicator for the triangle degree relations. 

When we analyze Figure 2, we can see a clear relation 
between global clustering coefficient C and the type of tri- 
angles. In high-C networks, minimum, middle, and maxi- 
mum degrees of triangle vertices are close in value. While, 
in low-C networks, triangles are highly heterogeneous. Ob- 
serve how very small values of di{i) connect to quite large 
d2{i) or ^3(2). In Figure 2e, Figure 2f and Figure 2h low de- 
gree vertices are connecting to two high vertices {d2{i) and 
0^3 (0)- Web-Stanford (Figure 2g) is less structured and has 
a different tendency from the rest but it is still visible that 
low degree vertices are connecting to high d3{i) value. 

The average clustering coefficient C is not a very distin- 
guishing metric for our study. The global clustering coeffi- 
cient C shows wide variance and is a better indicative of the 
triangle behavior. 

Observation 2: There is a non-trivial gap between the 
minimum, medium, and maximum degrees of triangles. 

Even though the degrees of triangle vertices are similar 
to each other in high-C networks, still there is a non-trivial 
between di{i) and d2{i) and between d2(i) and d3(i) in Fig- 
ure 2. This observation tells us that inside the communities 
or clusters in real networks, there exist triangles with vary- 
ing degrees of vertices. 

Observation 3: The ratios among degrees of triangle ver- 
tices are small in high-C networks and large in low-C net- 
works. 

The ratios of triangle degrees provide valuable informa- 
tion to see the distinction between networks. For the t-th 
triangle, three degree ratios are deffned as follow: r2i(t) = 
'■31 (t) = and r32(i) = These ratios 

are computed for all the triangles separately and their av- 
erage is computed as f2i, fsi, and f32, respectively. Table 2 
lists the average ratios for all the networks. 

There is a clear distinction between high-C and low-C net- 
works. The average ratios are very small in high-C networks, 
which also supports the triangle homogeneity in high-C net- 
works. The average of f2i and f32 are close to each other, in 
other words, the middle degree is both close to the minimum 
degree and the maximum degree. 

The average degree ratios are signiffcantly large in low- 
C networks. Particularly, the ratio between the maximum 
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Figure 2: Triangle degree- comparison plots which compare the medians of minimum degree, di{i), the middle degree, d2{i) 
and the maximum degree ds^i) 



Table 2: The average of triangle degree ratios 
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Figure 3: Percentage of triangles such that c/max/c^min < 10 



and the minimum degree is very high. We can see that 
in as-caida20071105, oregonl_010331, wiki-Talk, middle and 
maximum degrees are close to each other but these degrees 
differ from the minimum degree significantly. Web- Stanford 
acts little different. In this graph, minimum and middle 
degrees are close to each other but the maximum degree is 
distant to both the minimum and the medium degrees. 

We have also looked at the percentage of homogeneous 
triangles, which we define triangles where fsi < 10. Fig- 
ure 3 shows that more than 90% of the triangles in high-C 
networks are homogeneous. 

Observation 4: In low-C networks, high degree vertices 
within the top 1% participate in the vast majority of the 
triangles. 

In high-C networks, the triangles incident to low degree 
vertices are mostly connecting to two low degree vertices. 
On the other hand, in low-C networks (particularly in the 
low density networks), a significant portion of the triangles 



contain at least one high degree vertex. 

To set a threshold between low degrees and high degrees, 
we have experimented different percentiles of vertices that 
participate in at least one triangle. In Table 1, Atgo, Atgg, and 
the maximum degree of each network are presented. It is 
interesting that the gap between /^go and /^gg is quite large 
in low-C networks. We pick kqq as a threshold, since /^go 
is still relatively low compared to the maximum degree in 
most networks. A degree of a triangle vertex is considered 
low^ if the degree is no greater than kqq , high otherwise it is 
considered. 

We look at the percentages of the triangles having at least 
one high degree node in Figure 4. In low-C networks, we can 
say that high degree vertices within the top 1% participate 
in most of the triangles. In high-C^ high degree nodes are 
participating in less triangles except soc-Epinions network. 
Soc-Epinions has a very low clustering coefficient value (i.e., 
less than 0.1) and it is a border case. Hence, its statistics 




Figure 4: The percentage of triangles produced by vertices 
in the Top 1% degrees 



are higher than the other high-C networks. 

3. GRAPH-MODEL TRIANGLE BEHAVIOR 

In this section, we investigate how well random graph gen- 
erators match the real graphs in terms of triangle degree 
similarity. We concentrate on the graph models generating 
heavy-tailed degree distributions. 

3.1 Graph Models 

Barabasi- Albert [1] proposed the Preferential Attachment 
(PA) model is often associated with the "rich get richer" 
concept. In the PA model, a new node connects to a pre- 
specified number of vertices, where the likelihood of choosing 
a vertex is proportional to its degree. This procedure leads 
to graphs with power-law degree distributions. As Sala et 
al. [29] pointed out, PA model cannot create communities 
in the graph and cannot generate high clustering coefficients 
for low degree nodes. 

The Stochastic Kronecker Graph (SKG) model [19] starts 
with a basis matrix (typically 2x2), and generates a ma- 
trix that specifies edge probabilities by repeated Kronecker 
products. A noise-added version of SKG has been proven to 
generate log-normal degree distributions [31], but the clus- 
tering coefficients of SKG graphs are very low [26] . 

The Chung-Lu (CL) model [8] can be considered as pick- 
ing a random graph among all graphs with the same degree 
distribution. In this model, the probability of an edge is pro- 
portional to the product of the degrees of its endpoints, (i.e., 
Pr (dj) — ^2m)'^ ) • Piii^^ 6t ^1- showed that many properties 
of graphs generated by SKG and CL models are similar [26] . 

The Block Two-Level Erdos-Renyi (BTER) model [30] is 
built on the observation of high-clustering coefficients and 
skewed degree distributions. This model achieves high clus- 
tering coefficients by embedding communities with an Erdos- 
Renyi structure, which is typically much denser compared 
to the rest of the graph. Edges are added in a subsequent 
phase using the CL model, to satisfy the degree distribution 
requirements. It has been shown that BTER graphs can 
match many properties of real world graphs [30] . 

The Edge Copying (EC) model [16] imitates the topic- 
based communities in the Web, and generates an evolving 
directed graph. When a new node arrives, it selects a ran- 
dom vertex and copies a specified number of links. For the 
EC model, both the in-degree and out-degree distributions 
follow power laws. However, the model does not create back 



links and does not generate many triangles. 

The Forest Fire (FF) model [20] combines the PA model [1] 
to obtain a heavy-tailed degree distribution, the EC model 
[16] to obtain communities, and community guided attach- 
ment for densification. The model has a forward burning pa- 
rameter p. In the FF model, a node v arrives and chooses an 
ambassador node w randomly and connected to each neigh- 
bor of w with probability p. This process is repeated for 
each new vertex v connects to. Note that EC model only 
copies the links of a node, it does not hop to the neighbor 
of the node. 

3.2 Fitting Graph Models to Real Networks 

To check whether graph models can reproduce the trian- 
gle degree behavior of the real networks, we fit FF, BTER, 
EC, CL, PA, and SKG models to the real networks listed in 
Table 1. 

The inputs to the The PA model are the number of nodes 
n and the number of edges k created by each node. We pick 
k by rounding the density of the graph in order to match 
the number of edges in the real networks. 

The BTER model takes the degree distribution of the real 
networks. The connectivity per block is computed by a table 
look-up on the average clustering coefficient per degree Cd 
plot. 

For the Chung Lu model, we provide the degree distribu- 
tion of the real networks as an input to the model. 

For the Forest Fire model, we provide the number of nodes 
n and the forward burning probability p. We match the 
generated graph models to the number of edges in the real 
networks. For each target graph, we search a range of values 
by incrementing p value by (^p = 0.001 in range [0-1] to find 
the best model giving the similar number of edges to the 
original network. 

The EC model takes the number of nodes n, number of 
edges each node creates /c, and the edge copying probability 
p which is calculated based on power-law degree distribution. 
Power- law degree slope is jz^- We pick k by rounding up 
the density of graph. 

To generate the SKG model, we compute the parameters 
of the initiator matrix using the Kronfit algorithm [18]. The 
size of the final adjacency matrix is 2^^°^*^^-*^ where n is the 
number of nodes in the real graph. 

Before running our experiments, we symmetrize by treat- 
ing each edge as undirected and remove self-links for all the 
generated graphs. 

3.3 Triangle Analysis in Graph Models 

After fitting different graph models on the real networks, 
we enumerate triangles in each randomly generated network 
using Cohen's algorithm [10]. We analyze the triangle be- 
haviors in these random graphs in different aspects. 

The numbers of triangles: None of the graph models 
capture the triangle numbers for both high-C and low-C 
networks. 

Some models are good at generating similar number of 
triangles for high-C networks, some of them are good at for 
low-C networks, but none of them is good at both. The 
number of triangles generated by different graph models for 
each target graph is listed in Table 3. Graph models be- 
have differently in high-C and low-C networks in terms of 
generating triangles. 



Table 3: The number of triangles generated by graph models 
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In high-C networks, BTER matches the number of trian- 
gles in the original graph better than the rest of the mod- 
els. FF creates significantly more triangles than the original 
number of triangles. CL generates significantly less triangles 
than the original number of triangles. For networks with the 
high clustering coefficient C and high density p, CL is not a 
good choice. 

In low-C networks, BTER generates more triangles than 
the original number of triangles. As a matter of fact, when 
the C value is very small, BTER generates even more trian- 
gles than the original number of triangles. Particularly for 
wiki-Talk, it generates many more triangles. CL is also gen- 
erating many more triangles for wiki-Talk. CL is generating 
less triangles for web-Stanford but it generates similar num- 
ber of triangles for oregonl_010331 and as-caida20071105. 
FF generates similar numbers of triangles as in the original 
networks except wiki-Talk. It seems that none of the models 
can reproduce the structure of the wiki-Talk graph. 

EC, PA, and SKG generate significantly less triangles than 
the original triangle numbers. These models also cannot 
reach the average clustering coefficient per degree for any 
of the network. Therefore, we will not include them for the 
rest of the plots. 

Degree Relations: Models generate either homogenous 
or heterogonous triangles for graphs. In Figure 5, we show 

the relation between di (i) and d2{i) and in Figure 6, we show 
the relation between di (i) vs ds (i) for the real graphs as well 
as their modeled counterparts. 

CL produces heterogeneous triangles for both high-C and 
low-C networks in both Figure 5 and Figure 6. For low-C 
networks, it is very intriguing that CL graphs are generating 
the right type of triangles, since the di{i) vs d2{i) and di(i) 
vs d3(i) plots follow the true graph quite well. But we feel 
that this indicates that low-C networks have a CL flavor to 
them (i.e., triangles are random). 

BTER generates homogeneous triangles for both high-C 
and low-C networks. For high-C networks, BTER matches 
the di (i) vs d2 (i) but generates lower ds (i) values than orig- 
inal d3{i) values. 

FF behaves like BTER for low-C networks in di (i) vs d2 (i) 
and di{i) vs dsii) plots. Low degree di{i) values cannot con- 
nect to high degree vertices. For high-C networks, there is a 
steep increase after di{i) reaches 10 in some of the networks 
in both di{i) vs d2{i) and di{i) vs dsii) plots. Distance 
between FF's d2{i) and original d2{i) and FF's (^3(2) and 
original dsii) is considerable large. FF also reaches higher 
di(i) values than the original di{i) values. 



Distance Relations: Some models cannot provide a gap 
between minimum, medium, and maximum degrees of trian- 
gles. 

We provide the degree-comparison plots of generated mod- 
els in Figure 7 for one high-C and one low-C network as 
representative. The other networks in high-C and low-C 
networks have the similar degree-comparison plots. 

For high-C networks, BTER cannot provide the gap among 
minimum, middle, and maximum degrees. Having the al- 
most same degree for all di(i), d2{i)^ and (^3(2) conflicts 
our Observation 2. FF provides a distance between min- 
imum, middle, and maximum degrees, however, d2(i) and 
ds (i) strongly deviate from di (i) in the middle (it generates 
a bump) . CL also provides the gap among degrees but both 
d2{i) and ds^i) strongly deviates from di(i) at the beginning. 

For low-C networks, BTER does not generate distance at 
the beginning again but in the middle it jumps to high d3{i) 
values. FF provides the gap but it has a bump in the middle 
again. CL provides much wider gap between d2{i) and dsii). 

It is very clear in Figure 7 that CL generates heterogonous 
triangles in any network, and FF and BTER generate more 
homogenous triangles and none of the graph models exactly 
match the triangle behavior in the original network. 

The number of triangles per di{i): None of the models 
obtain the number of triangles per di{i) for both high-C and 
low-C networks. 

To understand, how triangles are distributed among de- 
grees, we analyze the number of triangles generated per min- 
imum degree di{i) for different graph models in Figure 8. 
Models behave differently for high-C and low-C networks 
again. 

For high-C networks, BTER model matches the original 
network behavior very well except Soc-Epinions (which is in 
the border of low-C). FF generates way too many triangles 
for high degree nodes except Soc-Epinions (it produces less 
triangles for this graph) . CL is generating consistently lower 
number of triangles than the original number per minimum 
degree di{i). For Soc-Epinions, CL generates slightly higher 
triangles but still less than the original. It is observable that 
when C decreases, CL starts to generate comparably more 
triangles. In high-C networks, FF reaches much higher di{i) 
values (except Soc-Epinions). BTER and CL are reaching 
the similar di{i) values to the real di{i) . 

For low-C networks, none of the graph model matches to 
the original network behavior. BTER is generating more 
triangles for high degree nodes. CL is catching the similar 
behavior in Figure 8e and in Figure 8f but for the others it 
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Figure 5: Triangle degree relations between di(i) and d2{i) in the generated graph models 
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Figure 6: Triangle degree relations between di{i) and d3{i) in the generated graph models 
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Figure 7: Triangle degree- comparison plots for the randomly generated graphs 



misses as well. FF tends to only generate triangles with rela- 
tively low degree vertices since the graphs are very sparse in 
low-C networks. Hence, the plot for FF ends very early. In 
low-C networks, FF reaches only one- tenth of the maximum 
di{i) value of the original networks (except web-Stanford). 

Clustering coefficient plots: The clustering coefficient 
behavior of models differ for high-C and low-C networks. 

When we analyze average local clustering coefficient iCd) 
per degree in graphs models as plotted in Figure 9, we can see 
the similar trends as in Figure 8. Clustering coefficient plots 
confirm that low degree vertices have high local clustering 
coefficient. 

For high-C networks, BTER matchs the local clustering 
coefficient behavior of the real graphs fairly well (in fact, 
the Cd values are used as input to the model). FF is not 
matching perfectly but generates high Cd values. CL is fail- 
ing to reach high clustering coefficient values per degrees for 
high-C networks but it has slightly better averages for low-C 
networks. 

For low-C networks, BTER deviates from the original Cd 
values in the middle. FF is not matching perfectly again but 
getting close. CL is generating higher Cd averages than it 
generated for high-C networks. 

4. CONCLUSIONS 

We believe that the dichotomy between homogeneous tri- 
angles and heterogeneous triangles is quite useful in char- 
acterizing graphs. This also gives a quantifiable method to 
distinguish the underlying processes creating these graphs. 
Social scientists and physicists have long tried to explain the 
behavior of humans (or appropriate agents) based on topo- 
logical structure. The degree-labeled triangles appears to 
give us a window into this behavior. 

The degree-labeled triangle behavior yields fascinating in- 



sight into the structure of real- world networks. As expected 
the triangles come in various types in different networks. 
However, our studies showed that the global clustering co- 
efficient is a good indicator of what kind of triangles the 
graph contains. High clustering coefficients (> 0.01) imply 
homogenous triangles (i.e., degrees of the vertices are close), 
while low clustering coefficients is a sign of heterogenous 
triangles (i.e., significant variance among the degrees of the 
vertices) . 

We have also investigated whether the current graph mod- 
els can regenerate the types of triangles in the real data. The 
results showed that while some models are good at match- 
ing the total number of triangles in the real graph, no model 
can match the types of triangles for both the high and low 
clustering coefficient graphs together. Our paper shows that 
community structures in graph models are not able to cap- 
ture the community behaviors in the real networks. There- 
fore, there is a room to improve the existent models and 
design more realistic graph models which support triangle 
degree behaviors. 
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